NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Dense-Sparse Dynamic Time Warping for Customizing Piano Concerto Accompaniments

https://doi.org/10.1109/ICASSP49660.2025.10890080

Tsai, TJ; Dey, Kavi; Özer, Yigitcan; Müller, Meinard (April 2025, IEEE)

Free, publicly-accessible full text available April 6, 2026
PBSCR: The Piano Bootleg Score Composer Recognition Dataset

https://doi.org/10.5334/tismir.185

Jain, Arhan; Bunn, Alec; Pham, Austin; Tsai, TJ (September 2024, Transactions of the International Society for Music Information Retrieval)

Full Text Available
Improving the Robustness of DTW to Global Time Warping Conditions in Audio Synchronization

https://doi.org/10.3390/app14041459

Kraprayoon, Jittisa; Pham, Austin; Tsai, TJ (February 2024, Applied Sciences)

Dynamic time warping estimates the alignment between two sequences and is designed to handle a variable amount of time warping. In many contexts, it performs poorly when confronted with two sequences of different scale, in which the average slope of the true alignment path in the pairwise cost matrix deviates significantly from one. This paper investigates ways to improve the robustness of DTW to such global time warping conditions, using an audio–audio alignment task as a motivating scenario of interest. We modify a dataset commonly used for studying audio–audio synchronization in order to construct a benchmark in which the global time warping conditions are carefully controlled, and we evaluate the effectiveness of several strategies designed to handle global time warping. Among the strategies tested, there is a clear winner: performing sequence length normalization via downsampling before invoking DTW. This method achieves the best alignment accuracy across a wide range of global time warping conditions, and it maintains or reduces the runtime compared to standard usages of DTW. We present experiments and analyses to demonstrate its effectiveness in both controlled and realistic scenarios.
more » « less
Full Text Available
FlexDTW: Dynamic Time Warping With Flexible Boundary Conditions

https://doi.org/10.5281/zenodo.10265392

Bükey, Irmak; Zhang, Jason; Tsai, TJ (November 2023, ISMIR)

Alignment algorithms like DTW and subsequence DTW assume specific boundary conditions on where an alignment path can begin and end in the cost matrix. In practice, the boundary conditions may not be known a priori or may not satisfy such strict assumptions. This paper introduces an alignment algorithm called FlexDTW that is designed to handle a wide range of boundary conditions. FlexDTW allows alignment paths to start anywhere on the bottom or left edge of the cost matrix (adjacent to the origin) and to end anywhere on the top or right edge. In order to properly compare paths of very different lengths, we use a goodness measure that normalizes the cumulative path cost by the path length. The key insight of FlexDTW is that the Manhattan length of a path can be computed by simply knowing the starting point of the path, which can be computed recursively during dynamic programming. We artificially generate a suite of 16 benchmarks based on the Chopin Mazurka dataset in order to characterize audio alignment performance under a variety of boundary conditions. We show that FlexDTW has consistently strong performance that is comparable or better than commonly used alignment algorithms, and it is the only system with strong performance in some boundary conditions.
more » « less
Full Text Available
Audio Cross Verification Using Dual Alignment Likelihood Ratio Test

https://doi.org/10.1109/icassp49357.2023.10095059

Lei, Heidi; Wonghirundacha, Arm; Bukey, Irmak; Tsai, TJ (January 2023, ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP))

Full Text Available
Segmental DTW: A Parallelizable Alternative to Dynamic Time Warping

https://doi.org/10.1109/icassp39728.2021.9413827

Tsai, TJ (May 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP))

Full Text Available

Search for: All records